AITopics | Amarillo

Collaborating Authors

Amarillo

Mysterious UFO-shaped 'Dorito' aircraft spotted over Area 51 as strange military code is heard

Daily Mail - Science & techJan-23-2026, 05:54:45 GMT

Trump orders a massive armada toward Iran with ominous warning about what could come next: 'We're watching' Mysterious UFO-shaped'Dorito' aircraft spotted over Area 51 as strange military code is heard Florida, Texas and California lead America's housing crash as other Sun Belt states start to crack as values plunge 7.6 percent Meghan Trainor's teary photo with her new baby born via surrogate has sparked an almost unsayable thought. Most women won't admit it... but I will: CAROLINE BULLOCK Billionaire who predicted 2008 crash issues stark warning over'worrying' new US trend but there's one way to protect your savings AND make money Canadian woman was euthanized'against her will' after husband was fed-up with caring for her Another awkward moment between Victoria Beckham and Nicola Peltz goes viral as fans claim Brooklyn's mum'is not the problem' Chilling video shows high school student rampaging through classroom with knife... before teacher steps in Trump describes excruciating ...

area 51, beckham, brooklyn beckham, (13 more...)

Daily Mail - Science & tech

Country:

Asia > Middle East > Iran (0.24)
North America > Canada > Alberta (0.14)
North America > United States > New York (0.04)
(22 more...)

Genre: Personal (0.46)

Industry:

Transportation > Air (1.00)
Media > Television (1.00)
Media > Music (1.00)
(8 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (0.68)

Add feedback

Testing GPT-4-o1-preview on math and science problems: A follow-up study

Davis, Ernest

arXiv.org Artificial IntelligenceOct-11-2024

In August 2023, Scott Aaronson and I reported the results of testing GPT4 with the Wolfram Alpha and Code Interpreter plug-ins over a collection of 105 original high-school level and college-level science and math problems (Davis and Aaronson, 2023). In September 2024, I tested the recently released model GPT-4o1-preview on the same collection. Overall I found that performance had significantly improved, but was still considerably short of perfect. In particular, problems that involve spatial reasoning are often stumbling blocks. On September 12, OpenAI (2024) released two preliminary versions, "ChatGPT-o1-preview" and "ChatGPT-o1-mini" of a forthcoming product "ChatGPT-o1".

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2410.2234

Country:

Europe > France (0.05)
North America > United States > California > San Francisco County > San Francisco (0.05)
North America > Canada > Quebec (0.05)
(10 more...)

Genre: Research Report (0.50)

Industry:

Education > Educational Setting (0.55)
Government > Space Agency (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Generative Context-aware Fine-tuning of Self-supervised Speech Models

Shon, Suwon, Kim, Kwangyoun, Sridhar, Prashant, Hsu, Yi-Te, Watanabe, Shinji, Livescu, Karen

arXiv.org Artificial IntelligenceDec-15-2023

When performing tasks like automatic speech recognition or spoken language understanding for a given utterance, access to preceding text or audio provides contextual information can improve performance. Considering the recent advances in generative large language models (LLM), we hypothesize that an LLM could generate useful context information using the preceding text. With appropriate prompts, LLM could generate a prediction of the next sentence or abstractive text like titles or topics. In this paper, we study the use of LLM-generated context information and propose an approach to distill the generated information during fine-tuning of self-supervised speech models, which we refer to as generative context-aware fine-tuning. This approach allows the fine-tuned model to make improved predictions without access to the true surrounding segments or to the LLM at inference time, while requiring only a very small additional context module. We evaluate the proposed approach using the SLUE and Libri-light benchmarks for several downstream tasks: automatic speech recognition, named entity recognition, and sentiment analysis. The results show that generative context-aware fine-tuning outperforms a context injection fine-tuning approach that accesses the ground-truth previous text, and is competitive with a generative context injection fine-tuning approach that requires the LLM at inference time.

context information, information, injection, (14 more...)

arXiv.org Artificial Intelligence

2312.09895

Country:

Asia > South Korea > Gyeonggi-do > Suwon (0.05)
North America > United States > Indiana > Saint Joseph County > Mishawaka (0.04)
North America > United States > Texas > Potter County > Amarillo (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Testing GPT-4 with Wolfram Alpha and Code Interpreter plug-ins on math and science problems

Davis, Ernest, Aaronson, Scott

arXiv.org Artificial IntelligenceAug-14-2023

Our test sets were too small and too haphazard to support statistically valid conclusions, but they were suggestive of a number of conclusions. We summarize these here, and discuss them at greater length in section 7. Over the kinds of problems tested, GPT-4 with either plug-in is significantly stronger than GPT-4 by itself, or, almost certainly, than any AI that existed a year ago. However it is still far from reliable; it often outputs a wrong answer or fails to output any answer. In terms of overall score, we would judge that these systems performs on the level of a middling undergraduate student. However, their capacities and weaknesses do not align with a human student; the systems solve some problems that even capable students would find challenging, whereas they fail on some problems that even middling high school students would find easy.

calculation, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2308.05713

Country:

North America > United States > Michigan (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > Canada > Quebec (0.04)
(40 more...)

Genre: Research Report (0.41)

Industry: Education > Educational Setting > K-12 Education > Secondary School (0.54)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Long-Range Transformers for Dynamic Spatiotemporal Forecasting

Grigsby, Jake, Wang, Zhe, Qi, Yanjun

arXiv.org Machine LearningSep-24-2021

Multivariate Time Series Forecasting (TSF) focuses on the prediction of future values based on historical context. In these problems, dependent variables provide additional information or early warning signs of changes in future behavior. State-of-the-art forecasting models rely on neural attention between timesteps. This allows for temporal learning but fails to consider distinct spatial relationships between variables. This paper addresses the problem by translating multivariate TSF into a novel spatiotemporal sequence formulation where each input token represents the value of a single variable at a given timestep. Long-Range Transformers can then learn interactions between space, time, and value information jointly along this extended sequence. Our method, which we call Spacetimeformer, scales to high dimensional forecasting problems dominated by Graph Neural Networks that rely on predefined variable graphs. We achieve competitive results on benchmarks from traffic forecasting to electricity demand and weather prediction while learning spatial and temporal relationships purely from data.

arxiv, sequence, transformer, (16 more...)

arXiv.org Machine Learning

2109.12218

Country:

North America > United States > Virginia (0.04)
North America > United States > Texas > Taylor County > Abilene (0.04)
North America > United States > Texas > Potter County > Amarillo (0.04)
(5 more...)

Genre: Research Report (0.82)

Industry: Energy > Power Industry (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Annotator Rationales for Labeling Tasks in Crowdsourcing

Kutlu, Mucahid (TOBB University of Economics and Technology) | McDonnell, Tyler | Elsayed, Tamer (Qatar University) | Lease, Matthew (University of Texas at Austin)

Journal of Artificial Intelligence ResearchSep-23-2020

When collecting item ratings from human judges, it can be difficult to measure and enforce data quality due to task subjectivity and lack of transparency into how judges make each rating decision. To address this, we investigate asking judges to provide a specific form of rationale supporting each rating decision. We evaluate this approach on an information retrieval task in which human judges rate the relevance of Web pages for different search topics. Cost-benefit analysis over 10,000 judgments collected on Amazon's Mechanical Turk suggests a win-win. Firstly, rationales yield a multitude of benefits: more reliable judgments, greater transparency for evaluating both human raters and their judgments, reduced need for expert gold, the opportunity for dual-supervision from ratings and rationales, and added value from the rationales themselves. Secondly, once experienced in the task, crowd workers provide rationales with almost no increase in task completion time. Consequently, we can realize the above benefits with minimal additional cost.

information retrieval, machine learning, natural language, (20 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.12012

AI Access Foundation

12012

Journal of Artificial Intelligence Research

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Minnesota (0.04)
(9 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology (1.00)
Media (0.92)
Leisure & Entertainment (0.92)
(2 more...)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Communications > Social Media > Crowdsourcing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback